Linear prediction modulation filtering for speaker recognition of reverberant speech

نویسندگان

Bengt J. Borgstrom

Alan McCree

چکیده

This paper proposes a framework for spectral enhancement of reverberant speech based on inversion of the modulation transfer function. All-pole modeling of modulation spectra of clean and degraded speech are utilized to derive the linear prediction inverse modulation transfer function (LP-IMTF) solution as a low-order IIR filter in the modulation envelope domain. By considering spectral estimation under speech presence uncertainty, speech presence probabilities are derived for the case of reverberation. Aside from enhancement, the LP-IMTF framework allows for blind estimation of reverberation time by extracting a minimum phase approximation of the short-time spectral channel impulse response. The proposed speech enhancement method is used as a front-end processing step for speaker recognition . When applied to the microphone condition of the NISTSRE 2010 with artificially added reverberation, the proposed spectral enhancement method yields significant improvements across a variety of performance metrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust front end processing for speech recognition in reverberant environments: utilization of speech characteristics

This paper proposes two methods for robust automatic speech recognition (ASR) in reverberant environments. Unlike other methods which mostly apply inverse filtering by blindly estimated room impulse responses to achieve dereverberation, the proposed methods are based on the utilization of the characteristics of speech. The first method Harmonicity based Feature Analysis – takes advantage of the...

متن کامل

Improving robustness to compressed speech in speaker recognition

The goal of this paper is to analyze the impact of codecdegraded speech on a state-of-the-art speaker recognition system and propose mitigation techniques. Several acoustic features are analyzed, including the standard Mel filterbank cepstral coefficients (MFCC), as well as the noise-robust medium duration modulation cepstrum (MDMC) and power normalized cepstral coefficients (PNCC), to determin...

متن کامل

Microsoft Word - gillespie-03-icassp

__________________________________________ * Currently affiliated with Microsoft Corporation. ABSTRACT We showed in [1] that penalizing long-term reverberation energy is more effective than maximizing the signal-to-reverberation ratio (SRR) for improving audible quality and automatic speech recognition (ASR) accuracy. Using this knowledge we propose a blind approach to speech dereverberation th...

متن کامل

Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation

Previously, a dereverberation method based on generalized spectral subtraction (GSS) using multi-channel least mean-squares (MCLMS) has been proposed. The results of speech recognition experiments showed that this method achieved a significant improvement over conventional methods. In this paper, we apply this method to distant-talking (far-field) speaker recognition. However, for far-field spe...

متن کامل

Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays

In this paper we describe the additive robustness obtained through the combined use of a first acoustic processing step based on a low complexity microphone array, followed by a spectral normalization step. Microphone arrays have shown to provide good results in reducing different sources of acoustic degradation. However, microphone arrays produce linear filtering effects that need to be compen...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Linear prediction modulation filtering for speaker recognition of reverberant speech

نویسندگان

چکیده

منابع مشابه

Robust front end processing for speech recognition in reverberant environments: utilization of speech characteristics

Improving robustness to compressed speech in speaker recognition

Microsoft Word - gillespie-03-icassp

Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation

Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays

عنوان ژورنال:

اشتراک گذاری